LARGE DEVIATIONS FOR GENERAL MARKOVIAN HAWKES 

PROCESSES 



LINGJIONG ZHU 

Abstract. In the 2007 paper, Bordenave and Torrisi [1] proves the large de- 
viation principles for Poisson cluster processes and in particular, the linear 
Hawkes processes. In this paper, we prove first a large deviation principle for 
a special class of nonlinear Hawkes process, i.e. a Markovian Hawkes process 
with nonlinear rate and exponential exciting function, and then generalize it 
to get the result for sum of exponentials exciting functions. We then provide 
an alternative proof for the large deviation principle for linear Hawkes process. 
Finally, we use an approximation approach to prove the large deviation prin- 
ciple for a special class of nonlinear Hawkes processes with general exciting 
functions. 
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1. Introduction 



We consider a point process with rate A(-). For the Poisson process, A(-) is a 
constant. For a renewal process, the independence of the intervals between suc- 
cessive events implies that A(-) involves Ft only through the time from t back to 
the point, if any, immediately preceding t, where Tt denotes the history of the 
process up to time t. It is natural to build up simple processes by supposing A(-) 
to involve Tt in specific ways. One possibility is to suppose that each point in the 
past infiuences A(-) in a way decaying in time and that contributions from distinct 
points add together. Let us consider the stochastic process {Zt,Nt), where Nt has 
Poisson rate X{Zt) at time t and A is a positive function with certain regularities. 
Zt is defined as 



where Tj is the j-th arrival time of Nt for j > 1. The exciting function h is positive, 
integrable and continuous. 

This is known as the general Hawkes process. When the exciting function h{-) 
is exponential or a sum of exponentials, the process is Markovian and we name it 
the general Markovian Hawkes process. 

When A(-) is linear, this is known as the (linear) Hawkes process, which was 
introduced in Hawkes [9]. If A(-) is linear and h{-) is exponential or a sum of 
exponentials, the (linear) Markovian Hawkes process is sometimes referred to as 
Markovian self-exciting processes, see for example Oakes [14]. You can think of the 
arrival times Tj as "bad" events, which can be the arrivals of claims in insurance 
literature or the time of defaults of big firms in the real world. Hawkes process 
captures both the self-exciting property and the clustering effect, which explains 
why it has wide applications in cosmology, ecology, epidemiology, seismological 
applications, neuroscience applications and DNA modeling. For a list of references 
to these applications, see Bordenave and Torrisi [T]. 

Hawkes process has also been applied in finance. Empirical comparisons suggest 
that Hawkes processes have some of the typical characteristics of financial time 
series. Financial data have been analyzed using Hawkes processes. Self-exciting 
processes are used for the calculation of conditional risk measures, such as the Value- 
at-Risk. Another area of finance where Hawkes processes have been considered is 
credit default modeling. Hawkes processes have been proposed as models for the 
arrival of company defaults in a bond portfolio. For a list of references to the 
applications in finance, see Liniger [13) . 

For a short history of Hawkes process, we refer to Liniger ^Sj. For a survey 
on Hawkes processes and related self-exciting processes, Poisson cluster processes, 
marked point processes etc., we refer to Daley and Vere- Jones [i]. 

When A(-) is linear, say A(z) = v + z, then one can use immigration-birth repre- 
sentation, also known as Galton- Watson theory to study it. Under the immigration- 
birth representation, if the immigrants are distributed as Poisson process with in- 
tensity v and each immigrant generates a cluster whose number of points is denoted 
by S, then Nt is the total number of points generated in the clusters up to time t. 
If the process is ergodic, we have 



(1.1) 




(1.2) 



t—^'OO t 



1\ + 

lim — = j/E[5'], 



a.s. 
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Bordenave and Torrisi proves that if < /i = h{t)dt < 1 and th[t)dt < 
oo, then (-^ G •) satisfies the Large Deviation Principle with the good rate function 

(1.3) = + if [0,00)^ 

I +00 otherwise 

where 9 — 9x is the unique solution in {—oo, fi — 1 — log/i) of £[6^"^] = pq^, 
X > 0. It is well known that, for instance, see page 39 of Jagers [TT], for all 

9 e (— oo, iJ, — 1 — log/i), E[e^"^] satisfies 

(1.4) E[e^^] = e« exp {M(E[e^^] - 1)} . 

Once the LDP for G •) is estabhshed, it is easy to study the ruin probability. 
Stabile and Torrisi [TS] considered risk processes with non-stationary Hawkes claims 
arrivals and studied the asymptotic behavior of infinite and finite horizon ruin 
probabilities under light-tailed conditions on the claims. 

We are interested in general non-linear A(-). If A(-) is non-linear, then the usual 
Galton- Watson theory approach no longer works. If the exciting function h is 
exponential or a sum of exponentials, the process is Markovian and there exists 
a generator of the process. The difficulty arises when h is not exponential or a 
sum of exponentials in which case the process is non-Markovian. Another possible 
generalization is to consider h to be random. Then, we will get a marked point 
process. (For a discussion on marked point processes, see Cox [3].) 

When A(-) is nonlinear, Bremaud and Massoulie [5] proves that under certain 
conditions, there exists a unique stationary version of the nonlinear Hawkes process 
and Bremaud and Massoulie also proves the convergence to equilibrium of a 
nonstationary version, both in distribution and in variation. 

In this paper, we will prove the large deviation when h is exponential and A is 
non-linear first. Then, we will generalize the proof to the case when /i is a sum 
of exponentials. We will use that to recover the result proved in Bordenave and 
Torrisi 1 . Finally, we will prove the result for a special class of non-linear A and 
general h. 

Before we proceed, recall that a sequence {Pn)neN of probability measures on a 
topological space X satisfies the large deviation principle (LDP) with rate function 
/ ; X — > M if / is non-negative, lower semicontinuous and for any measurable set 
A, 

(1.5) - inf I{x) < \immi- log Pn{A) < lim sup i log P„ ( A) < - inf I{x). 

x£A° Ti-s-oo n n^oo n xeA 

Here, A° is the interior of A and A is its closure. See Dembo and Zeitouni l5| for 
general background regarding large deviations and the applications. Also Varadhan 
[16) has an excellent survey article on this subject. 



2. An Ergodic Lemma 

Let us prove an ergodic theorem first. Assume h{t) = Ylt^i ■ Here 
hi > 0, ai might be negative but we assume that h{t) > for any t > 0. Zt = 
J2r <t Ht - Tj) = Eti Z^{t), where Z,{t) = J2r <t a*e-'''(*-"^). The generator A 
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for {Zi{t), . . . , Zd{t)) is given by 

(2.1) Af = ^^^^i^i-^ ^ H^i, ■ ■■ , Zd)[f{zi + ai, . . . , Zd + ad) ~ f{zi, .. ., Zd)]. 

T ^Zi 

We want to prove the existence and uniqueness of the invariant probabiUty mea- 
sure 

The lecture notes [8] by Martin Hairer gives the criterion for the existence and 
uniqueness of the invariant probability measure for Markov processes. 

Suppose we have a jump diffusion process with generator C. If we can find u 
such that u > 0, Cu < Ci — C2U for some constants Ci, C2 > 0, then, there exists 
an invariant probability measure. We thereby have the following lemma. 

Lemma 1. Consider h{t) = X^iLi ajC^^'* > 0. Let Ci — +1 if a.i > and — —1 if 
a.i < 0. Assume A(zi, . . . , z^) < X^iLi Q^ikil+Z^; where (3 > and ai > 0, 1 < i < d, 
satisfies X^iLi ^T^'^i ^ 1- Then, there exists a unique invariant probability measure 
for iZ,it),...,Zdit)). 

Proof. Try u{zi, . . . , Zd) — X^iLi ^iCiZi > 0, where q > 0, 1 < i < d. Then, 
d d 

(2.2) ylw = - ^ htidZi + A(zi , . . . , Zd) ^ aiEid 

i=l i=l 
d d d d 

2—1 i—1 i—1 2—1 

Take q = ^ > 0, we get 

(2.3) < - f 1 - E ^) E + /3 E ¥^ 

\ i=i * / (=1 i=i * 

< - mm hi - \ I ~ } — \ u + p y — . 

\ i—i / i—i 

Next, we will prove the uniqueness of the invariant probability measure. Let 
us consider the simplest case h{t) = ae~^*. To get the uniqueness of the invariant 
probability measure, it is sufficient to prove that for any x,y > 0, there exists some 
T > and < a < 1 such that 

(2.4) ||7'^(T,.)-P^(T,.)||Ty <2(l-a). 

Let us assume that x > y > 0. Conditional on the event that Zf and Z^ have 
exactly one jump during the time interval (0,T) respectively, the law of V^{T, •) 
and V^lT,-) are absolutely continuous with respect to some probability measures 
with positive density on the sets 

(2.5) ((a + a;)e~^'^,xe"''^ + a) and {{a + y)e-'''^ ,ye-''^ + a) 
respectively. Choose T > i log(^^^^i^), we have 

(2.6) ((a + x)e-'^, xe^'^ + a) f] ((a + y)e''^ , ye-"^ + a) ^ 0, 
which implies that V^{T, •) and V^(T, •) are not mutually singular. 
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Similarly, wc can prove the uniqueness of the invariant probability measure for 
the multidimensional case. We need to conditional on the event that we have exactly 
d jumps during the time interval (0, T) for both Zf and , where x,y € Z^. Then, 

(2.7) 

where Zf^ = XiC' 



+ ELiOie-''-^*"^^), l<i<d. Then, 7'^(r,-) and Vy{T,-) 



are not mutually singular for sufficiently large T. 



□ 



3. LDP FOR General Markovian Hawkes Processes with Exponential 

Exciting Function 

We assume first that h{t) = ae~^*, where a,b > 0, i.e. the process Z^ jumps 

upwards an amount a at each point and decays exponentially between points with 
rate b. In this case, Zt is Markovian and {Zt,Nt) is a "Self-Exciting Shot Noise 
Process" . 

Notice first that Zq = Nq = and 

(3.1) dZt = -bZtdt + adNt, 

which implies that Nt = -^Zf + ^ Zgds. 

We prove first the existence of the limit of the logarithmic moment generating 
function of Nt. 

Theorem 2. Assume that lim^^cx) = and A(-) is hounded below by some 
positive constant, then, we have 

(3.2) lim -logE[e^^*] =r(^), 

t— >CX) t 

where T{6) is defined as 

(3.3) T{e)= sup \ f —z^dz)+ [{X- X)w{dz)- [ (logiX/X))XTT{dz)\ , 

where Qe is defined as 

(3.4) Qe = ^{X, tt) G Q : A has unique invariant probability measure ^| , 
where 



(3.5) 



Q=l^{X,w):7reM{R+),J ^tt < oo, A e i^(7r)| . 



Proof. Note that for any real 6, we have E[e^^*] < oo and 



(3.6) 



By Dynkin's Formula, for any u which satisfies Au + Vu < Mu, we have 
(3.7) 



E 



E 



{Au{Z,) + V{Z,)u{Z,))efo nz.)dv 



ds 



< u{Zo) + M [ E \u{Zs)e^o v(Zv)dv 
Jo '- 



ds, 



which implies by Gronwall's Lemma that 



(3.8) 



E 



u(Zt)e/o ^(^3)<i«l < u{Zo)e^' = u(0)e^*. 



LINGJIONG ZHU 



In our case, V{z) = ^z. Now for any u{z) > cie»^, we have 



(3.9) 



,ONt 



< 



Cl 



-E 



J Cl 



Mt 



Therefore, we get 



1 r fiMi Au(z) + —zuiz) 

3.10) Umsup-fogE e^^* < in£ sup ' °: —, 



t—^'OO 



where 
(3.11) 



= |w e T^iA) : u{z) > Cie^"", for some ci > o| . 



Define the tilted generator A as 



(3.12) 



Af{z) = + Xiz)[f{z + a) - /(z)]. 



Then, the tilted probability measure P is given by 



(3.13) 
Now, we have 



dP 
dP 



= exp^ / {X{Z,)-\{Z,))ds+ / log 



KZs) 



dNs>. 



(3.14) Uminf-logEfe^^'l 

t— >oo t 



lim inf - log E 

t— >-oo t 



oxp^iV,-log- 



> lim inf E 

t— >-CSO 



HminfE 



-6'iVt log — 

t t ^ dF 



loNt --1 {\{Zs) - \{Zs))ds - I log 
t t Jo Jo 



XiZs) 



dm 



Since Nt — /q X{Zs)ds is a (local) martingale under P, we have 



(3.15) 



E 



/ 

Jo 



log 



(dN, - XiZ,)ds) 



= 0. 



X{Z,) 

Therefore, by Ergodic Theorem, for any (A, ^) G Qe, we have 



(3.16) liminf ilogEfe' 



> lim inf E 



jONt - - I (A(Z,) - X{Z,))ds - I log 



XjZs 
X{Zs 



X{Zs)ds 



= J^ZTt{dz) + jCX- X)TT{dz)- j (log(A) - log(A)) X7t{dz). 
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Hence, we have 
(3.17) 



liminf -logE[e^^'] 

t— >oo t 



> sup 
(A,*)eee 



J^zn + J{X-X)^- J (log(A) - log(A)) Att 



Define 



(3.18) T = <^f : f{z) = Kz + 5(2), K > -, g \a Ci with compact support \ . 
It is easy to check that 



(3.19) 



Therefore 



inf 

fey" 

have 
1 



j Af{z)%{dz) 



if(A,7r)eQe, 
-00 if (A, tt) e Q\Qe. 



(3.20) liminf -logE[e'*^*] > sup inf 



(3.21) 



> sup inf 



zn-H{X,n)+ / Afn}, 



where TZ = {(Att, tt) : (A, tt) G Q} and 



(3.22) 
Define 



^(A, tt) = H{7r) = y [(A - A) + log (a/a) 



eb 



(3.23) F(A7r,7r,/) = J — zn - H {X, n) + j Afi: 

= J^zn-HCx,rr)- J bz^^ + j {f {z + a) - f {z))Xn . 
Notice that F is Unear in / and hence convex in / and also 



(3.24) 



A/ + A(l-e 



H{X^'k) = sup 

/eC6(R+) 

where C(,(M"'") denotes the set of bounded functions on M+. Inside the bracket 
above, it is linear in both tt and A^-. Hence H is weakly lower semicontinuous 
and convex in (Att,?!"). Therefore, F is concave in (A7r,7r). Furthermore, for any 
f = Kz + geT, 

(3.25) F(A7r, tt, /) = j bzn - ^(A, t^) - J bz^w 

{g{z + a) — g{z))XTT + Ka I Att. 



If XnTTn loo and 7r„ — > TToo weakly, then, since g is Ci with compact support, we 
have 



(3.26) 



~ j bz^T^n + j ig{z + a) - g{z))X„-jr„ + Ka J Xn-Wn 
- J bz^'^00 + J {g{z + a) - g{z))joo + j 
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as 71 — >■ oo. Moreover, in general, if Pn — > P weakly, then, for any / which is upper 
semicontinuous and bounded from above, we have limsup,j / fdPn < J fdP. Since 
(- — K) bz is continuous and nonpositive on R+, we have 



(3.27) 



lim sup 



— K ] bzTTn < 



— K bzTTn 



Hence, we conclude that F is upper semicontinuous in the weak topology. 

In order to switch the supremum and infimum in p.2ip . since we have already 
proved that F is concave, upper semicontinuous in (Att, vr) and convex in /, it is 
sufficient to prove the compactness of TZ to apply Ky Fan's Minmax Theorem (see 
[B]). Indeed, Joo developed some level set method and proved that it is sufficient to 
show the compactness of the level set (see [T^ and [7]). In other words, it suffices 
to prove that, for any C € M and f G T, the level set 



(3.28) 



(Att, TT)en:H+ I bz^ir - — ztt - A[/(z + a) - f(z)]TT < C 
az a 



is compact. 

For any / — Kz + g G J^, where K > ^ and g is Ci with compact support, there 
exists some Ci , C2 > such that 
(3.29) 

Ci>H+(K-^-\b j ZTT^C2 I Att 



> 



'x>cz+e 
C2 



A- A + Alog(A/A) 



K ] b I ZTT 

a , 



Att — C2 Att 

\>cz+e Jx<cz+e 



> 



mm log — — 1 - 62 

z>o A(z) 



Att + 



x>cz+e 



-c-C2+\K 



ZTI-IC2. 



We choose the parameters such that < c < ~ f ) ^ and £ large enough such 



cz+l 
\(z) 

Hence, we proved that 
(3.30) 
where 

(3.31) C3 



that min2>o log ^^^y — 1 — C2 > 0, where we used the fact that lim^ 



zA < C3, / Att < C4, 

\>cz+e 



X(z) 



Ci + IC2 



Ci + IC2 



min^>o log ^ - 1 - G 



Att + / A^ < C4 + c • C3 + £, 

\>cz+t Jx<cz+e 



-c-C2 + {K-^)b 

Therefore, we have 

(3.32) / ^'^ " 
and hence 

(3.33) H{\, ^) < Ci + C2 [C4 + c • C3 + £] < 00. 
Therefore, for any (A„7r„,7r„) e 7?., we get 

(3.34) lim sup ( 7r„ < lim sup ^ [ Z7r„ < lim ^ = 0, 
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which miphcs the tightness of 7r„. By Prokhorov's Theorem, there exists a sub- 
sequence of TTn which converges weakly to ttoo. We also want to show that there 
exists some 700 such that A„7r„ — > 700 weakly (passing to a subsequence if neces- 
sary). Here, 7r„ may be a subsequence of the original sequence if necessary. It is 
enough to show that 

(i) sup„/ A„7r„ < 00. 

(ii) lim^^oo sup„ A„7r„ = 0. 

(i) and (ii) will give us tightness of XnT^n and hence implies the weak convergence 
for a subsequence. 

Now, let us prove statements (i) and (ii). 
To prove (i), notice that 



(3.35) sup / A„7r„ — sup / — Z7r„ < — [C4 + c ■ C3 + i] < 00. 

n J n J a a 

To prove (ii), notice that (A — A„) -I- A„ log(A„/A) > by Remark[6l Notice that 

lim sup / A„7r„. 



(3.36) lim sup / A„7r„ < lim sup / A„7r. 

f-i-oo „ Jz>i " ' 



" "'A„>VAz,z>£ 



For the first term, since sup„ / 2;7r„ < 00 and lim2_j.oo = 0, 
(3.37) lim sup / A„7r„ < lim sup / "\/Az7r„ = 0. 



For the second term, since limsup^_j.g2 = 0, 



(3.38) lim sup / 

< lim sup / if(A„,7r„) sup x x , \ 1 m /xx 

f-foo „ JA„>yAl,z><' A„>VaI,z>£ ^ - + -^n iOg(A„/Aj 



An7r„ 

'A„>x/Az,2>^ 

An 



Therefore, passing to some subsequence if necessary, we have A„7r„ — > 700 and 
7r„ — >■ TToo weakly. Since we proved that F is upper semicontinuous in the weak 
topology, hence the level set is compact in the weak topology. Therefore, we can 
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switch the supremum and infimum in p.21|) and get 
(3.39) hminf ilogE [e^^*] 



(3.40) 
(3.41) 
(3.42) 
(3.43) 
(3.44) 



t-i-oo t 

> inf sup sup 



9b 



ZTT + (A - A)^ - log(A/A)A^ + Af-K 



= inf sup 



— + A(z)(e/(^+")-^W-l)-fez^ 
a oz 



TT{dz) 



inf sup 



inf sup 
/e-^ z>o 



+ A(z)(e-^(^+"'^^(^) -l)-6z 



df 

dz 



^ . , .' Au 9b 
> mi sup < \ z 



We need some justifications. Define G(A) = A — log(A/A)A + Af. The supremum 
of G{X) is achieved when ^ — which implies A = Xef(^+°-'>^f(^'>. Notice that for 

a given f £ T, the optimal A = \Qfi^+°-)-fi^) satisfies f Xn < oo since J Att < oo 

and / ZTT < oo. This gives us p.4ip . Notice that for any / = Kz + g E 

(3.45) 

— + A(z)(e/(^+")-/(-) - 1) - bz^ = b{9-Ka) ^ ^ ^^^-^^^Ka+giz+aJ-giz) _ 



dz 



whose supremum is achieved at some finite z* > since lim^— ^oo — — and 
K > ^. Hence J ztt < oo is satsified for the optimal tt. This gives us p.42p . 
Finally, for any f E J-, u = e-^ G Ug, which implies p. 441) . □ 



Now, we are ready to prove the large deviations result. 

Theorem 3. Assume lim2_>.co ^^^^ = and A(-) is bounded below by some positive 
constant. Then, (^ € •) satisfies the large deviation principle with the rate function 
/(•) as the Fenchel-Legendre transform ofT(-), 

(3.46) lix) ^ sup {9x~T{9)}. 

Proof. If limsup2_j.oo ^^T^ — Oi then Lemma [5] implies that T{9) < oo for any 9. 
Thus, by Gartner-Ellis Theorem, we have the upper bound. For Gartner-Ellis 
Theorem and a general theory of large deviations, see for example [5]. To prove the 
lower bound, it suffices to show that for any a: > 0, e > 0, we have 

(3.47) liminf ilogP ( — e B,{x)] > - snp{ex ^ T{9)}, 

t->oo t \ t J g 

where B^{x) denotes the open ball centered at x with radius e. Let P denote 
the tilted probability measure with rate A as defined in Theorem [21 By Jensen's 
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inequality, we have 



1 



(3.48) -log¥[ — eB^{x 



>ilog 



-logP -i e B^{x) - - log 



1 



t 



> -logP — e B,{x) 



-E 



By Ergodic Theorem, we get 
(3.49) 

where A^{x) is defined as 
(3.50) 

where is defined as 



liminf i logP ( — G B,{x) ] > -A{x) 

t-s-oo t \ t 



A{x) ^ inf <^ / (A - A)^ + / log(A/A)A7r } , 

(A,*)ess 



(3.51) 

Notice that 
(3.52) r(6l) 



= (A,7r) G Se : / Xiz)7r{dz) ^ x 



sup 

(A,7r)eSe 



eXfr + J {X-X)tt- J log(A/A)A7r 
eX7r+ [CX-X)n- [ log(A/A)A7r 



= sup sup 

^ (A,#)GQ 

= sup{6'a:; — A(a;)}. 

X 

We prove in Lemma[4]that A(a::) is convex in x, identifying it as the convex conjugate 
of T{9) thus concluding the proof. □ 

Lemma 4. A(x) in (|3.50p is convex in x. 
Proof. Define 

(3.53) HCx,n)^ J {X - X)7r + Jlogi'X/XfXn. 
Then, we have 

(3.54) Mx)= inf H(A,^). 

(A,Tr)eQ5 

We want to prove that A{axi + 13x2) < aA(a;i) + l3K{x2) for any a,/? > and 
a + P = 1. For any e > 0, we can choose (AfejTrfe) G 2^*= such that H{Xk,TTk) < 
A(xfe) + e/2, for A: = 1, 2. Set 



(3.55) 



TTs = QfTTl + /37r2, A3 



aTTi 



-Ai 



ani + /37r2 aTTi + /37r2 



-A2. 
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Then, for any test function /, we have 

(3.56) j Asfns =aj Aifin + p j ia/Tra = 0, 

which implies that (AsjTTs) G Q,,- Furthermore, 



(3.57) 



Therefore, we have (A3,7r3) e Q»^^+P^\ Finally, since x log a; is a convex function 
and if we apply Jensen's inequality, we get 



(3.58) H{X3,n3)= (A - A3 - A3 log A) + A3 log Ag 



7r3 



< 



A3 - A3 log A) + aAi log Ai + /3A2 log A2 



7r3 



= aif(Ai,7ri)+/3if(A2,7r2). 

Therefore, we conclude that 
(3.59) 

A{axi+pX2) < H{X3,TC3) < aH{\i,TTi) + I3H(\2,tt2) < aA{xi) + l3A{x2) + e. 
Lemma 5. // lim sup2_j.oo < ^, then, for any 



□ 



(3.60) 



61 < log 



ahmsup^^^ ^ 



a A(2;) 
— 1 + - • limsup , 



we have r{9) < 00. //limsup^^^^ = 0, then r{9) < 00 for any 6 G 



9 k.^Tr^ „Kz 



Proof. For K > ^, we have e ' G Ug and 



(3.61) T{e) < inf sup 



Ag{z)+"-^zg{z) (Ae'^'^eb 



9{z) 
9b 



< sup 



+ —z 



>o I e^^ a 



= sup <^ - 6if ]z + A(z)(e^'' - 1) 



Define the function 
(3.62) 



z>0 



F{K) = -K + limsup ^ • (e^" - 1). 
z^oo bz 



Then F(0) = 0, F is convex and -F(ii') 00 as K ^ 00 and its minimum is 
attained at 



(3.63) 



K* = - log 

a 



o lim sup^ — ^ 



>0, 



"Z->-(X> z 

and F{K*) < 0. Therefore, r{e) < 00 for any 



(3.64) 



K>0 



< —a min <—K + lim sup 



log 



^alimsup^^,^ ^ 



bz 



a 1- A(z) 
1 + - • hmsup — ^ < if a. 

b z^oo z 
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If limsup2_^go = 0, trying e^^ G Ug for any K > f , we have T{9) < oo for any 

e. □ 

Remark 6. Notice that x — 1 — logx > for any x > 0. Hence 



(3.65) A - A + A log(A/A) = A (A/A) - 1 - log(A/A) 



> 0. 



Lemma 7. Let X{z) = a + (3z. If has a stationary limit Z, then 
(3.66) E[Z] 



b-aP' 

Proof. Let ^ be the invariant probabihty measure for A, then for any test function 
/, we have 

(3.67) jAfcP = J fA*ct> = 0. 

If wc try f{z) = z, we get Az = —bz + {a + /3z)a and therefore 

(3.68) E^iZ] = 

□ 

4. LDP FOR General Markovian Hawkes Processes with Sum of 
Exponentials Exciting Function 

Now, let hhe a sum of exponentials, i.e. h{t) = J2i=i Oie"^** and let 
(4.1) Zi{t) = J2 a»e-''*(*-^^\ l<i<d, 

Tj<t 

and Zf = Y^i^iZi{t) = '^T.^fh{t — Tj). It is easy to see that {Zi, . . . , Zd) is 
Markovian in M"^ and its generator is given by 

{zi +ai,...,Zd + ad) - f{zi, Zd)]. 



(4.2) ^/ = -^6iZ,|^ + A[^z,] •[/(. 



bi > for any 1 < i < d. But can be negative, as long as h{t) = J2i=i > 0- 

In particular, h{0) — X^iLi ^^i > 0- If '^i > 0) then Zi{t) > almost surely; if aj < 0, 
then Zi{t) < almost surely. 

Theorem 8. Assume lim2^.(x) = 0. Then, 

(4.3) lim ilogEfe^^'l = inf sup {— + —1^ V b^zA , 

where 



(4.4) Zx^e = |u e Ci{R'^),u > cie ^f=i , for some ci > 0^ 

and Z = {(-21, . ■ - ^Zd) ■ aiZi > 0, 1 < i < d}. 
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Proof. Notice that 

(4.5) dZ,{t) = -b,Z,{t)dt + UidNt, l<i<d. 

Hence, aiNi = Zi{t) — Zi{()) + hiZi{s)ds and 



(4.6) E[e''^*] = E 



exp ■ 



i=i«i Ei=ia»-^o 
Therefore, by Feynman-Kac formula, we obtain the upper bound 



t d 

biZi{s)ds 



(4.7) limsupilogE[e^^*] < inf sup 
As before, we can obtain the lower bound 



Au e 



1 



(4.8) liminf-logE[e' 



eNt] 



> sup 
(A,#)eQe 



> sup inf 
= sup inf 



0\- \ + '\\og{x/\^ TT{dzi,...,dzd) 
0X - X + X- X\og(^X/xj +Ag tt 



(A,#)eQ 



Ed 
i=l«i 



A + A - Alo! 



:(A/A)+i/ 



The last line above is by taking f = g -\- ^^=^ ' G for ^ € ^, where 
(4.9) e= |^ifei^i+5,K>0 , (? is Ci with compact support > . 



Here, Cj = aj/|aj|, 1 < i < rf. Define 



(4.10) 



i^(A7r,7r,/) = 



/ 



TT-H{X,7r). 



F is linear in / and hence convex in /. Also H is weakly lower semicontinuous 
and convex in (A-n-,7r). Therefore, F is concave in (A-^jTt). Furthermore, for any 

/ = + Eti -^eiz. + 5 e .F, 



(4.11) F(A7r,7r,/) = 



i=l 



Xtt — J y^ KeibiZiTf- — H{X, tt) + / Agn. 

i—l 



If X„Tr„ — >■ 7oo and 7r„ — >■ tToo weakly, then, since g is Ci with compact support, we 
have 



(4.12) 



/ 



1=1 



An7r„+ / ^57r„ 



7oo + / ^ffTToo- 



Since — Ei=i KcihiZi is continuous and nonpositive on .E, we have 



(4.13) 



limsup / 

n— )-cso J 



i=l 



1=1 
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Hence, we conclude that F is upper semicontinuous in the weak topology. 

In order to apply the minmax theorem, we want to prove the compactness of the 
level set in the weak topology 



(4.14) 



(A7r,7r) 



Ed 



Af 



H{X,7t)<C 



For any / = ^'=^ ' + Yl,i=i ^^i^i + 9 & where g is Ci with compact support 
etc., there exist some Ci, C2 > such that 



(4.15) Ci>H + Y^ Khe^ j z.tt - C2 j Att 

d 

A- A + Alog(A/A) 7r + ^ 

Z — 1 

Att 



> 

-C2 



> 



Att-Cs 

cizi H h CdZd + i „ 

mm log — — 1 - 02 

izi,...,za)ez A(zi H + Zd) 



Att 



d „ 



If tti > 0, then €i > 0, pick up q > such that — q • C2 + -ftT^iCi > 0. If < 0, then 
< 0, pick up Ci such that — q • C2 + i^^iCi < 0. Finally, choose £ big enough such 
that the big bracket above is positive. Therefore, we have 



(4.16) 



U,:I^<C3, / A7r<C4. 



Hence, / A-S" < C5 and H < Cq. We can use the similar method as in the proof of 
Theorem [5] to show that 



(4.17) 



lim sup / A„7r„ = 0, 1 <i < d. 



\zA>t 
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For any (A„7r„,7r„) £ TZ, we can find a subsequence that converges in the weak 
topology by Prokhorov's Theorem. Therefore, 

(4.18) 



liminf-logE[e^^*] 

t^oo t 



> sup inf 



(A,#)gQ 



inf sup sup 



^ A 



Ed 
,=1 

Ed 
»=i 



A + A- Alog(^A/Aj +Af 
-A + A- Alog (a/a) +i/ 



inf^ sup ^gjr'^'^' + A(e/(^^+'^--^-+'^-)--^(^--'^-) " 1) - E ^^^^ ^'^^ 



> ml sup < — 



92;,; 



^1 i=l 



That is because optimizing over A, we get A = Ae-^'^^^"''''^' ' '^''"*"''''' f{zi,---,zd) g^j^(j 
finally for each f E J-, u — E Uq. □ 

Theorem 9. Assume lim^-yoo ~ a«rf A(-) is bounded below by some positive 
constant. Then, (^ G •) satisfies the large deviation principle with the rate function 
/(•) as the Fenchel-Legendre transform ofT(-), 



(4.19) 

where 
(4.20) 



I{x) = sup {61a; -r (61)}, 

0eR 



r{9) = sup 

(A,7r)eQe 



6'A- A + A- Alog (^A/A 
Proof. The proof is the same as in the case of exponential h{-) 



□ 



5. LDP FOR Linear Hawkes Processes: An Alternative Proof 

In this section, we use our method to recover the result proved in Bordenave and 
Torrisi [T] . We prove the existence of the limit of logarithmic moment generating 
function first. The strategy is to use the tilting method to prove the lower bound. 
That requires an ergodic lemma, which we state as Lemma [TUl For the upper 
bound, we can opitimize over a special class of testing functions for the linear rate 
with sum of exponential exciting function Any continuous and integrable h 
can be approximated by a sequence h„. By a coupling argument, we can use that 
to approximate the upper bound for the logarithmic moment generating function 
when the exciting function is h. Finally, by tilting argument for the lower bound 
and Gartner-Ellis theorem for the upper bound, we can prove the large deviations 
for the linear Hawkes processes. 

Lemma 10. Assume \{z) ^ a + (3z and /i = h[t)dt < oo. // /3/i < 1, then 
there exists a stationary and ergodic probability measure n for Zt and J zir — jz^- 
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Proof. The ergodicity is a well known result lor linear Hawkes process. (See Hawkes 
and Oakes jl^.) Let n be the invariant probability measure lor Zt, then 



(5.1) 



lim — - 

t^oo t 



\{z)iT{dz) = a + (3 ZTT{dz). 



II Zt is invariant in taking expectations to Zt = X*_^ h{t — s)dNs 



(5.2) E[Zt] = j zTT{dz 
which implies that J zn 



A(z)7r(dz) / h{t — s)ds — fi / X{z)TT{dz), 



□ 



Lemma 11. // h{t) > 0, h{t)dt < oo, h{oo) = and h is continuous, then h 
can be approximated by a sum of exponentials both in and L°° norms. 

Proof. The Stone- Weierstrass Theorem says that ilJf is a compact HausdorlF space 
and suppose A is a subspace ol C{X) with the following properties, (i) II /, g G A, 
then f X g Q A. (ii) 1 G A. (iii) If x,y £ X then we can find an / G A such that 
f{x) ^ f{y). Then A is dense in C{X) in L°° norm. Consider X = R+ U {oo} = 
[0, oo] and C[0, oo] consists ol continuous lunctions vanishing at oo and the constant 
function 1. 

By Stone- Weierstrass Theorem, the linear combination ol 1, e^*, e~^* etc. is 
dense in C[0,oo]. In other words, lor any continuous lunction h on C[0, oo], we 
have 



(5.3) 



sup 

t>0 



hit) -Y^a.e-'' 

3=0 



< e. 



In lact, since h{oo) = 0, we get |ao| < e. Thus 



(5.4) 



sup 

t>0 



h{t) 



< 2e. 



However, X]j=i ^j^ may not be positive. We can approximate h(t) first by 
a sum of exponentials and then approximate h{t) by the square ol that sum of 
exponentials, which is again a sum ol exponentials but positive this time. 

Indeed, we can approximate h{t) by the sum of exponentials in norm as well. 
Suppose 11^— hn\\L=° 0, where /i„ is a sum ol exponentials. Then, by Dominated 
Convergence Theorem, lor any 5 > 0, J \h — hn\e~^^dt — >■ as n — oo. Thus, we 
can find a sequence (5„ > such that (5„ ^ as n — >■ oo and ^ \h~ hn\e^^"'^dt — >■ 0. 
By Dominated Convergence Theorem again, / h{l - e-^"*)dt 0. Hence, we have 



J\h- h„ 



-s„t 



\dt ^ as n oo, where h„e " is a sum ol exponentials 



We will show that /i„ e 
(5.5) \\h 



hne-'"'\\L 



* converges to h in L°° as well 

•o < \\h — hn\\L=° + \\h' 



hne ''"*||l= 



Notice that (1 - e-'^"*)/i„ < (1 



){h{t) + e). Since h{oo) = 0, there exists some 



M > 0, such that lor t> M, h{t) < e so that (1 - e^^"'^)h„ < 2e lor t > M. For 
t<M, {I- e-^^^'^)hn < (1 - e-^^^'){\\h\\L^ + e) which is small il (5„ is small. □ 
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Theorem 12. Assume X{z) — ly + z, h > and h{t)dt < oo. We have 
(5.6) lim - logE[e''^'] = i/(a; - 1), 

t— >oo t 

where x is the minimal solution to x — e^+A'(^-i)^ where fi ~ h{t)dt. 
Proof. By Lemma [TUl we have 



(5.7) 



liminf-logE[e^^'] > sup 
> sup 



ex + x-x-x\og 



(a/a) 
(a/a) 



> sup 

0<K<^,(KX,TT)eQ, 



1 log if 

K ^ 



Xtt 



> sup 



l---logX 



Kv 



1 - K^i 



J v{x — 1) if 6* e (— oo, /i — 1 — log ^) 
I +0O otherwise ' 

where x is the minimal solution to x = e^^^*^^"^). 

Let h{t) < hn{t) + h^{t) where /i„(t) = Ez"=i a^e"'''* and ft.,(i) = \h{t) ~ /i„(i)|, 
which tends to zero in both and L°° norms as i — )■ oo. Let Di be the set of 
points generated by the point process with Poisson rate A(^^g^^ ^^(/i„(t — t)) 
and then let D2 be the set of points generated by the point process with Poisson 
rate X{Y,T£Di,T<t - '^)) - HJ^reDurKt ^n{t - r)) and then let Dj be the set 
of points generated by the point process with Poisson rate X{^^^^j-i ^ h(t — 
'''))" A(^^gyj-2 ^ ^^^h{t—T)), for any j > 3. LetMj(t) correspond to the number 
of points in Dj by time t. Therefore, 



(5.8) 

Now, for any N eN, 
(5.9) 

\eEf=iA/,(t)" 



'E~iA/.(t) 



= E 

< E 

< • • 



< 



jMi{t)+fM-i{e)M2it) 



< 



„eAfi(t) + (e^«-i('»)_i)|[^_,i^|[^iMi(t) 



where /j(6l) = (e^^-iW - 1)||/i||li + 6*, for j > 2 and /i(6') = 6I. Thus, for any 
9 < fi — 1 ~ log(^), e^""^'-^-' converges to ?/ as — > 00, where y is the minimal 
solution to y = gS+M(i/-i)^ Hence, 



(5.10) 



lim sup - logE[e^^*] < Tn{p0), 

t->-oo t 
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where p = 1 + y\\h — /inllii- For TnipO), we have 
TnipO) = mt sup < h > 6i 

^^^PO {zi,...,z„)eZ [ U l^^=x" 



a. 



■ r \ Au p9 ^sr^ ^ 
< mt sup <^ h > b. 



= inf sup 



i ~ ^ biC^Zi + (i/ + zi + • • • + z„) (e^'=i - l) 
i=i 

where c* satisfies — fejC* + e^?=i^>''' — 1 + ^,f^ ^. 6i = 0, for each 1 < z < n. By 
some computations, it is not hard to see that Xn = e^^'=i satisfies 



(5.11) x„=exp|p0 + ^^(x„-l)| 

= exp \{l + v\\h- K\\L^)e + {xn - 1) / hn{t)dt 



Since ft,„ — s- /i in norm, it is not hard to see that Tn{p9) converges to the minimal 
solution of .T = e^+ll''ll-f-i (^^1) as n — > oo. □ 

Theorem 13. Assume \{z) = v + z and fi — h(t)dt < oo, /i > and h is 
continuous. Then, {Nt/t G ■) satisfies a large deviation principle with the rate 
function I{x) given by 



(5.12) I{x) = 



oo otherwise 



Proof. For the upper bound, apply Gartner-Ellis theorem. For the lower bound, use 
the tilting method and identify I{x) as the Fenchel-Legendre transform of r{d). □ 



Remark 14. In Bordenave and Torrisi pLj, their I (x) has the form 
(5.13) I{x) 



^^- + ^-17^ z/a;e[0,oo) 
+00 otherwise 



where 9 = 6^ is the unique solution in (—00, /i — 1 — log/^) o/E[e^"^] — a; > 0. 

Here, £[6^"^] satisfies the equation 

(5.14) E[e^^] = exp {Ai(E[e^'5] - 1)} , 

which implies that 9x ~ log (yj;^^^ — M (u^^ ^ ■ Substituting into the formula, 
their rate function is the same as what we got. 
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6. LDP FOR A Special Class of General Hawkes Processes: An 
Approximation Approach 

In this section, we prove the large deviation results for {Nt/t G •) for a very 
special class of nonlinear A(-) and general h{-). 

Let Pn denotes the probability measure under which A^^ follows the Hawkes 
process with exciting function /i„ = such that /i„ — >■ /i as n ^ oo in 

both and norms. Let us define 

(6.1) r„(e)= lim llogE^" [e^^'l. 

We have the following results. 

Lemma 15. For any K > and 9\,92 e [—K,K\, there exists some constant 
C{K) only depending on K such that for any n, 

(6.2) |r„(^i) - r„(^2)| < C{K)\9^ - 92\. 
Proof. Without loss of generality, assume that 62 > 9i such that 

(6.3) r„(^i) < r„(^2) 

= sup / (6*2 - 6'i)A7r + ^iAtt - if (A, tt) 

< sup /(^2-^i)A7r + r„(^i), 
(A,#)eQ« J 



where 
(6.4) 



Q: = |(A,7r)eQe: J ^iA7r-^(A,7r)>r„(^i)-l|. 



The key is to prove that sup^j;^ #)eQ* I — ^i-^) f*^^ some constant C{K) > 
only depending on K. Define u{zi, . . . , z„) = e2^i=i where 



(6-5) Ci = ^ — — • -, l<i<n. 



3K 1 

Define l/ = -4:^i such that 



(6.6) V{zu ...,zn) = ^^n-^ Y.^i- A(^i + • • • + ^„)(e3^ - 1). 

Notice that / Afrr = for any test function / with certain regularities. If we try 

/ = fj-, 1 < » < we get 

(6.7) - j ZiTT + ^ j Xw = 0, l<i<n. 
Summing over 1 < i < n, we get 

(6-8) [Xn = ^j-^[j2^i^- 

J Z^j=l bi J i=i 
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Notice that Y^^=i = ll^nlUi which is approximately when n is large. Since 

limsup2_>.oo = and X]"=i — 0> have 

(6.9) e,Jx7r<Kjx7r=^^jf2zi7r<^Jv7r + a 



1/2 



where Ci/2{K) is some positive constant only depending on K. 

We claim that JV{z)n < H{n) for any tt € Q*. Let us prove it. By Ergodic 
Theorem and Jensen Inequality, we have 
(6.10) 



j V{z)7r - 



lim 

t—>-oo 



V{Z,)ds 



< lim sup - log E'' 

t->CX) t 



/o V(Z,)ds 



eJo 



+ HiTT). 



Next, wc will show that u > 1. That is equivalent to proving that J27=i — ^• 
Consider the process 

(6-11) y^=±^ = i:±t 

2—1 Tj <t 2—1 



ip-bi{t-Tj) 



where g{t) = X;r=i fe"^'*. Notice that g(f) = h{s)ds > 0. Therefore, > 
almost surely and thus F — 0- Since =^ + F = and w > 1, by Feynman-Kac 
Formula and Dynkin's Formula, we have 



(6.12) E" 



J^V{Z,)ds 



u{Zt)e 



= u{Zo) - 
= u{Zo), 

and therefore / V{z)Tt < H{7t) for any n G Ql- Hence, 



{Au{Zs) + V{Zs)u{Zs))e 



ds 



1 



1 



(6.13) e^J XiT<-J V{z) + C,/2{K) <^H + C,/2iK). 
Notice that 

(6.14) - oo < r„(6li) -l<9x j \tt-H < r„(6'i) < oo. 
Hence, we have 



(6.15) 



which implies that H < 2(Ci/2(K) - r„(6'i) + 1). Hence, 

(6.16) Jxn<^J V7r+^C,/2{K) < l(Ci/2(^)-r„(^i) + l) + -^Ci/2(^). 
Finally, notice that sup„ Ij/inlUi < + m for some m > and thus 

(6.17) TM > Tni-K) > ^^^^^^,+^{-K} > ^^^^^^^^{-k), 

where r||;j||^j_|_„ denotes the case when the exciting function is \\h\\i^i + m and 
rii^ll is the case when X{z) = v + z which dominates the original A(-) for i/ > 
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big enough. Moreover, r||,^|| ^^^{^K) is finite since —K < 0. Hence, we conclude 
that 

(6.18) sup / At < C{K), 

for some C{K) > only depending on K. □ 

Lemma 16. Assume that A(-) > c for some c > 0, lim2_j.oo = and A(-)" is 
Lipschitz with constant La for any a > 1. For any K > Q, r„(0) is Cauchy with 9 
uniformly in [—K, K] . 

Proof. Abusing the notation, let us write H{t) = A(^^.^( h{t — Tj)). Observe first 
that, for any q, 

is a martingale under P„. By Holder's inequality, we get 
(6.20) 



rlP 

dPr. 



geWt-/J(A(H„(3))-A(//„(s)))ds-/o'log(i^|^)dAr^ 
\,peNt-p J^^{\(H,^{s))-\(H^(s)))ds 



r^,;„Mog(4<5^)dAr. 



By Cauchy-Schwarz inequality, we get 
(6.21) 



E^ 



1/9 



< 



We also have 
(6.22) 

E^' 



i/p 



^peWt+pLi||/i,„-/i„||^iAft 



gPe7Vt-p/„'(A(i/„(s))-A(H„(s)))ds < 

Therefore, by Lemma [T5l and the fact r„(0) = for any n, we have 
(6.23) 

-r„(^) 



i/p 



P 2g V 



< CiK)Lier, 



C{K) L2qem,n 1 



1 



2q 



„2q-l 



-r„(p(?) - -r„(0)+ 1- - |r„(0)| 



1 



C(JC) L2g£„,„ ^ C(i^)(p-l)/< 
2(7 c29-i p 



C{K)K, 
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where em,n = \\hm — hn\\L^- Hence, we have 

(6.24) nnisup{r„(0) - r„(0)} <2(l--) C{K)K, 

m,n— foo \ P J 

which is true for any p > 1. Letting p i 1, we get the desired result. □ 

Remark 17. // A(-) > c > and lim2^oo = for any a > 0, then, A(-)'^ is 
Lipschitz for any cr > 1. For instance, A(z) — [log(z + c)]^ satisfies the conditions, 
where /? > and c > 1. 

Theorem 18. Assume that A(-) > c for some c > 0, Mmz^aa = and A(-)" is 
Lipschitz with constant La for any a > 1- We have Hmt_5.oo 7 logE[e*"'^'] exists for 
any 9 gM. and 

(6.25) hm - logE[e^^'] = r{e), 

t— >oo t 

where T{9) = hni„^oo r„(6'). 

Proof. By Lemma [T51 we have that T{d) = hm„_j.oo r„(0) exists and r„(6') con- 
verges to the hmit uniformly on any compact set [—K,K]. Since r„(0) is Lip- 
schitz by Lemma 1151 it is continuous and the limit F is also continuous. Let 
En — \\hn ^ ^11 Li l£ As in the proof of Lemma 1161 for ^ny € [—K,K], we 
get 

(6.26) limsupilogE[e^^*] 

t->-oo t 

< r„(0) + C{K)L,e,, + ^ . ^ + 2 (1 - ^) CiK)K. 

Letting n — > 00 first and then p I 1, we get limsupi_^oQ 7 logE[e*^'] < r{d). 
Similarly, 



3.27) T„{B) < liminf — logE[e(P^+P^i'")^'] -Kliminf — logE 
t-i-oo pt t->oo 2qt 



< liminf logEie^P'"^*] + liminf logEie^'f^^^"^*] 
t-i-oo pp't t-i-oo pq't 



lim inf log E 

t-i-oo 2qt 



Since we can dominate A(-) by the linear case X{z) = v + z and under linear case the 
limit of logarithmic moment generating function T^{9) is continuous in 9, letting 
n — > cxD, we have 

(6.28) r{9) < liminf logE[ePP'^^']. 
This holds for any 9 and thus 

1 / 9 

(6.29) liminf - logEfe'^^'l > pp'T — 

t-)-oo t \PP 

Letting p,p' I 1 and using the continuity of r(-), we get the desired result. □ 
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Theorem 19. Assume that A(-) > c for some c > 0, lim^^oo = and A(-)" 
is Lipschitz with constant La for any a > 1. We have {Nt/t G •) satisfies the large 
deviation principle with the rate function 

(6.30) i{x) = sup{fe - r{e)}. 

Proof. For the upper bound, apply Gartner-Ellis Theorem. Let us prove the lower 
bound. Let B^{x) denote the open ball centered at x with radius e > 0. By Holder's 
inequality, we have 



3.31) 



P,r 



t 



e B,{x) < 



dP„. 



Therefore, letting t 

< -^r{pp'Lien) 
PP 



3.32) sup{9x 

Bern 



(X), we have 
1 



lim -logP„ 

t->oo t 



LP I 



t 



e B,{x] 



1/9 



e B,[x) 



1 



L 



2pq' ^ n 



— lim inf — log I 

q t-i-oo t 



2pq' \ c?Pi' 

where e„ = ||/i„ — Hence, letting n — > cxd, we have 

'Nt 



Nt 



e B,{x) 



(6.33) 



— lim inf — log I 

q t^oo t 



e B^{x) ) > limsupsup{6'a; - r„(6')}. 



We know that r„(6') 
(6.34) 



T{9) uniformly on any compact set K, thus 

r„(0)}^ sup{fe-r(0)}, 

e<^K 



sup{0x 



as n — >■ oo for any compact set K. Notice that A(-) > c > and recall that the 
limit for the logarithmic moment generating function with parameter 9 for Poisson 
process with constant rate c is (e^ — l)c. Hence 



(6.35) 

(7 — 

which implies that supgg 
(6.36) ^ 



limmfi:4^>liminfi^ 



-^rOO 

{9x 



r,i(6')} supegR{fe- 



— lim inf — log I 

q t->-oo t 



Nt , . 



> sup{0a; 



r(6i)}. Therefore, 
r(0)}. 



Letting ^ 1, we get the desired result. 



□ 



Remark 20. The class of nonlinear Hawkes process with general exciting function 
h for which we proved the large deviation principle here is unfortunately a big too 
special. It works for the rate function like X{z) = [log(c + 2)]^ for example but does 
not work for A(-) that has sublinear polynomial growth. In fact, by the coupling 
argument we used in the proof of the case of linear A(-) in Theorem WA we can prove 
that in the case when VmVz^oo = and A(-) is a-Lipshcitz and A(-) > c > 0, 
T{9) = lim„_>.oo r„(0) for 9 < fi— 1 — log/z, where fi ~ h{t)dt and F and F„ are 
the limit of logarithmic moment generating functions when the exciting functions 
are h and hn respectively and hn ^ h in . For the linear case, since T{9) — oo 
for 9 > ji — \ — log /i, the coupling argument is good enough. However, for the 
sublinear A(-), T(9) < oo for any 9 and the coupling argument is not enough. In 
fact, it will appear in Zhu [17] that under the condition that limz_j.oo = 0, A(-) 
is positive, increasing, a-Lipshcitz and A(-) > c > and h{-) is positive, decreasing 
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and th{t)dt < oo, there is a level-3 large deviation principle from which we can 
use the contraction principle to get the level- 1 large deviation principle for {Nt/t e 
•). Therefore, we conjecture that in the sublinear case, T{9) = liin„_>.oo r„(0) for 
any 9 and {Nt/t G •) satisfies the large deviation principle with rate function I{x) = 
supgg][j{0x — T(9)}. The advantage of approximating the general case by the case 
when h is a sum of exponentials is that Tn{9) can be evaluated as an optimization 
problem, which might be possibly computable by some numerical schemes. 
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